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A One-Way Packet Duplication Metric 


Status of This Memo 


This document specifies an Internet standards track protocol for the 
Internet community, and requests discussion and suggestions for 
improvements. Please refer to the current edition of the "Internet 
Official Protocol Standards" (STD 1) for the standardization state 
and status of this protocol. Distribution of this memo is unlimited. 


Copyright Notice 


Copyright (c) 2009 IETF Trust and the persons identified as the 
document authors. All rights reserved. 


This document is subject to BCP 78 and the IETF Trust's Legal 
Provisions Relating to IETF Documents in effect on the date of 
publication of this document (http://trustee.ietf.org/license-info). 
Please review these documents carefully, as they describe your rights 
and restrictions with respect to this document. 


Abstract 


When a packet is sent from one host to the other, one normally 
expects that exactly one copy of the packet that was sent arrives at 
the destination. It is, however, possible that a packet is either 
lost or that multiple copies arrive. 


In earlier work, a metric for packet loss was defined. This metric 
quantifies the case where a packet that is sent does not arrive at 
its destination within a reasonable time. In this memo, a metric for 
another case is defined: a packet is sent, but multiple copies 
arrive. The document also discusses streams and methods to summarize 
the results of streams. 
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Les 


Introduction 


This document defines a metric for one-way packet duplication across 
Internet paths. It builds on the IP Performance Metrics (IPPM) 
Framework document [RFC2330]; the reader is assumed to be familiar 
with that document. 


This document follows the same structure as the document for one-way 
packet loss [RFC2680]; the reader is assumed to be familiar with that 
document as well. 


The structure of this memo is as follows: 
o First, a singleton metric, called Type-P-one-way-packet-arrival- 


count, is introduced to measure the number of arriving packets for 
each packet sent. 


o Then, a singleton metric, called Type-P-one-way-packet- 
duplication, is defined to describe a single instance of packet 
duplication. 


o Next, this singleton metric is used to define samples, Type-P-one- 
way-Packet-Duplication-Poisson-Stream and Type-P-one-way-Packet- 
Duplication-Periodic-Stream. These are introduced to measure 
duplication in a series of packets sent with either Poisson- 
distributed [RFC2680] or periodic [RFC3432] intervals between the 
packets. 


o Finally, statistics that summarize the properties of these samples 
are introduced. 


Requirements Notation 


The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", 
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this 
document are to be interpreted as described in [RFC2119]. 


Although RFC 2119 was written with protocols in mind, the key words 
are used in this document for similar reasons. They are used to 
ensure the results of measurements from two different implementations 
are comparable and to note instances when an implementation could 
perturb the network. 
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.2. Motivation 
When a packet is sent from one host to the other, one normally 
expects that exactly one copy of the packet that was sent arrives at 
the destination. It is, however, possible that a packet is either 
lost or that multiple copies arrive. 
In earlier work, a metric for packet loss was defined [RFC2680]. 
This metric distinguishes between cases where the packet arrives and 
where the packet does not arrive within a reasonable time. In this 
memo, a metric for a third outcome is defined: a single packet is 
sent, but multiple copies arrive. 
As this document describes a Case similar to the one discussed in 
[RFC2680], all considerations from that document on timing and 
accuracy apply. 
A Singleton Definition for One-Way Packet Arrival Count 

1. Metric Name 
Type-P-one-way-packet-arrival-count 

.2. Metrics Parameters 
o src, the IP address of a host 
o dst, the IP address of a host 
o T, the wire time of a packet at the source 
o TO, the maximum waiting time for a packet to arrive at the 

destination. 

.3. Metric Units 
An integer number. 

.4. Definition 


Two packets are considered identical if and only if: 


o Both contain identical information fields (see Section 2.5). The 
recipient thus could take either packet and use the data in an 
application. The other packet does not contain any additional 
information. 
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o Both packets appear to have been sent by one and the same host, to 
one and the same destination. Hosts are identified by their IP 
addresses. 


The value of a Type-P-one-way-packet-arrival-count is a positive 
integer number indicating the number of (uncorrupted and identical) 
copies received by dst in the interval [T, T+TO] for a packet sent by 
src at time T. 


If a packet is sent, but it is lost or does not arrive in the 
interval [T, T+T0], then the metric is undefined. Applications MAY 
report an "impossible" value (for example, -1) to indicate this 
condition instead of undefined. 


If a packet is fragmented during transport and if, for whatever 
reason, reassembly does not occur, then the packet will be deemed 


lost. It is thus not included in the Type-P-one-way-packet-arrival- 
count. 
2.5. Discussion 


This metric counts the number of packets arriving for each packet 
sent. The time-out value TO SHOULD be set to a value when the 
application could potentially still use the packet and would not 
discard it automatically. 


If this metric is used in parallel with the Packet Loss Metric 
[RFC2680], the value of TO MUST be the same for both cases in order 
to keep the results comparable. 


The metric only counts packets that are not corrupted during 
transmission and may have been resent automatically by lower layers 
or intermediate devices. Packets that were corrupted during 
transmission but, nevertheless, still arrived at dst are not counted. 


Clocks do have to be synchronized between src and dst such that it is 
possible to uniquely and accurately determine the interval [T, T+T0] 
at both sides. 


If this metric is used in an active measurement system, the system 
MUST NOT send multiple packets with identical information fields in 
order to avoid that all packets will be declared duplicates. This 
metric can be used inside a passive measurement system as well, using 
packets generated by another source. However, if the source can send 
two identical packets within the interval [T, T+T0], this will be 
incorrectly labeled as a duplicate, resulting in a false positive. 

It is up to the implementor to estimate if this scenario is likely to 
happen and the rate of false positives that is acceptable. 
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The definition of identical information fields is such that two 
packets are considered to be identical if they are sent from the same 
source and contain the same information. This does not necessarily 
mean that all bits in the packet are the same. For example, when a 
packet is replicated and the copies are transferred along different 
paths, the Time to Live (TTL) may be different. The implementation 
MUST specify which fields are compared when deciding whether or not 
two packets are identical. 


In the case of IPv4, these will usually be: version, ihl, 
identification, src, dst, protocol, some or all upper-layer protocol 


data. 


In IPv6, these will usually be: version, next header, source, 
destination, some or all upper-layer protocol data 


Note that the use of the identification field is not present in non- 
fragmented IPv6 packets and may not be sufficient to distinguish 
packets from each even in IPv4, particularly at higher transmission 
speeds 


2.6. Methodology 


The basic technique to measure this metric follows the methodology 
described in Section 2.6 of [RFC2680] with one exception. 


[RFC2680] does not specify that the receiving host should be able to 
receive multiple copies of a single packet, as it only needs one copy 
to determine the metrics. Implementations for this metric should 
obviously be capable of receiving multiple copies. 

2.7. Errors and Uncertainties 
Refer to Section 2.7 of [RFC2680]. 

2.8. Reporting the Metric 
Refer to Section 2.8 of [RFC2680]. 

3. A Singleton Definition for One-Way Packet Duplication 


3.1. Metric Name 


Type-P-one-way-packet-duplication 
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3.2. Metrics Parameters 
o src, the IP address of a host 
o dst, the IP address of a host 
o T, the wire time of a packet at the source 


o TO, the maximum waiting time for a packet to arrive at the 
destination. 


3.3. Metric Units 
An integer number. 
3.4. Definition 
The value of a Type-P-one-way-packet-duplication is a positive 
integer number indicating the number of (uncorrupted and identical) 
additional copies of an individual packet received by dst in the 
interval [T, T+T0] as sent by src at time T. 
If a packet is sent and only one copy arrives in the interval [T, 
T+T0], then the metric is 0. If no copy arrives in this interval, 
then the metric is undefined. Applications MAY report an 
"impossible" value (for example, -1) to indicate this condition. 
3.5. Discussion 
This metric is equal to: 
Type-P-one-way-packet-arrival-count - 1 
This metric is expected to be used for applications that need to know 
duplication for an individual packet. All considerations regarding 
methodology, errors, and reporting from the previous section apply. 
4. Definition for Samples for One-Way Packet Duplication 
4.1. Poisson Streams 


4.1.1. Metric Name 


Type-P-one-way-Packet-Duplication-Poisson-Stream 
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.1.2. Metric Parameters 

o src, the IP address of a host. 
o dst, the IP address of a host. 
o Ts, a time. 


o Tf, a time. Ts and Tf specify the time interval when packets can 
be sent for this stream. 


o TO, the maximum waiting time for a packet to arrive at the 
destination. 


o lambda, a rate in reciprocal seconds. 

.1.3. Metric Units 

A sequence of pairs; the elements of each pair are: 

o T, a time 

o Type-P-one-way-packet-arrival-count for the packet sent at T. 
.1.4. Definition 

Given Ts, Tf, and lambda, we compute a pseudo-random Poisson process 
beginning at or before Ts, with average-rate lambda, and ending at or 
after Tf. Those time values greater than or equal to Ts, and less 
than or equal to Tf are then selected. At each of the times in this 
process, we obtain the value of Type-P-one-way-packet-arrival-count. 
The value of the sample is the sequence made up of the resulting 
(time, duplication} pairs. If there are no such pairs, the sequence 
is of length zero, and the sample is said to be empty. 

.1.5. Methodology 

Refer to Section 3.6 of [RFC2680]. 

.1.6. Errors and Uncertainties 

Refer to Section 3.7 of [RFC2680]. 


.1.7. Reporting the Metric 


Refer to Section 3.8 of [RFC2680]. 
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4.2. Periodic Streams 
4.2.1. Metric Name 
Type-P-one-way-Packet-Duplication-Periodic-Stream 
4.2.2. Metric Parameters 
o src, the IP address of a host. 
o dst, the IP address of a host. 
o Ts, a time. 


o Tf, a time. Ts and Tf specify the time interval when packets can 
be sent for this stream. 


o TO, the maximum waiting time for a packet to arrive at the 
destination. 


o lambda, a rate in reciprocal seconds. 
4.2.3. Metric Units 
A sequence of pairs; the elements of each pair are: 
o T, a time 
o  Type-P-one-way-packet-arrival-count for the packet sent at T. 
4.2.4. Definition 
At time Ts, we start sending packets with a constant-rate lambda, 
until time Tf. For each packet sent, we obtain the value of Type-P- 
one-way-packet-arrival-count. The value of the sample is the 
sequence made up of the resulting (time, duplication} pairs. If 
there are no such pairs, the sequence is of length zero and the 
sample is said to be empty. 
4.2.5. Methodology 
Refer to Section 4.5 of [RFC3432]. 


4.2.6. Errors and uncertainties 


Refer to Section 4.6 of [RFC3432]. 
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4.2.7. Reporting the metric 
Refer to Section 4.7 of [RFC3432]. 
5. Some Statistics Definitions for One-Way Duplication 


Note: the statistics described in this section can be used for both 
Type-P-one-way-Packet-Duplication-Poisson-Stream and Type-P-one-way- 
Packet-Duplication-Periodic-Stream. The application SHOULD report 
which sample was used as input. 


5.1. Type-P-one-way-packet-duplication-fraction 


This statistic gives the fraction of additional packets that arrived 
in a stream. 


Given a Type-P-one-way-Packet-Duplication-Poisson-Stream, one first 
removes all values of Type-P-one-way-Packet-Duplication that are 
undefined. For the remaining pairs in the stream, one calculates: 
(Sum Type-P-one-way-packet-arrival-count/Number of pairs left) - 1 
(In other words, (number of packets received)/ (number of packets sent 
and not lost).) 


The number can be expressed as a percentage. 
Note: this statistic is the equivalent to the Y.1540 IPDR [Y1540]. 
5.2. Type-P-one-way-replicated-packet-rate 


This statistic gives the fraction of packets that was duplicated (one 
or more times) in a stream. 


Given a Type-P-one-way-Packet-Duplication-Poisson-Stream, one first 
removes all values of Type-P-one-way-packet-arrival-count that are 
undefined. For the remaining pairs in the stream, one counts the 
number of pairs with Type-P-one-way-packet-arrival-count greater than 
1. Then, one calculates the fraction of packets that meet this 
criterion as a fraction of the total. (In other words: (number of 
duplicated packets)/ (number of packets sent and not lost).) 


The number can be expressed as a percentage. 


Note: this statistic is the equivalent of the Y.1540 RIPR [Y1540]. 
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Examples 


Consider a stream of 4 packets, sent as: 


o Case 1: (1, 2, 3, 4) 

o Case 22. (1, 1, 2, 27 3) 36 Ay 4) 

ó “Case 3% (ls Ty ay 2) 2r 2r 3% Fr B Bj 46-4) 
Oo Case As (1y Ap typo dy -3y 3 3% 4) 


Case 1: No packets are duplicated in a stream, and both the Type-P- 
one-way-packet-duplication-fraction and the Type-P-one-way-packet- 
replicated-packet-rate are 0. 


Case 2: Every packet is duplicated once, and the Type-P-one-way- 
packet-duplication-fraction is 100%. The Type-P-one-way-replicated- 
packet-rate is 100%, too. 


Case 3: Every packet is duplicated twice, so the Type-P-one-way- 
packet-duplication-fraction is 200%. The Type-P-one-way-replicated- 
packet-rate is still 100%. 


Case 4: Half the packets are duplicated twice and the other half are 
not duplicated. The Type-P-one-way-packet-duplication-fraction is 
again 100%, and this number does not show the difference with case 2. 
However, the Type-P-one-way-packet-replicated-packet-rate is 50% in 
this Case and 100% in case 2. 


However, the Type-P-one-way-packet-duplication-rate will not show the 
difference between cases 2 and 3. For this, one has to look at the 


Type-P-one-way-packet-duplication-fraction. 


Finally, note that the order in which the packets arrived does not 
affect the results. For example, these variations of case 2: 


o Case. 2a: (Ly 1, 2.2, 3, 3 Ay 4) 
o. Case 2b: (1, 2, 3, 4, Ty, 2, 3, 4) 


o Case Zes (ly 2, By Ay Ay 3 27 L) 
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(as well as any other permutation) all yield the same results for 
Type-P-one-way-packet-duplication-fraction and the Type-P-one-way- 
replicated-packet-rate. 


6. Security Considerations 


Conducting Internet measurements raises both security and privacy 


concerns. This memo does not specify an implementation of the 
metrics, so it does not directly affect the security of the Internet 
nor of applications that run on the Internet. However, 


implementations of these metrics must be mindful of security and 
privacy concerns. 


There are two types of security concerns: potential harm caused by 
the measurements and potential harm to the measurements. The 
measurements could cause harm because they are active, and they 
inject packets into the network. The measurement parameters MUST be 
carefully selected so that the measurements inject trivial amounts of 
additional traffic into the networks they measure. If they inject 
"too much" traffic, they can skew the results of the measurement, and 
in extreme cases, cause congestion and denial of service. 


The measurements themselves could be harmed by routers giving 
measurement traffic a different priority than "normal" traffic or by 
an attacker injecting artificial measurement traffic. If routers can 
recognize measurement traffic and treat it separately, the 
measurements will not reflect actual user traffic. If an attacker 
injects artificial traffic that is accepted as legitimate, the loss 
rate will be artificially lowered. Therefore, the measurement 
methodologies SHOULD include appropriate techniques to reduce the 
probability that measurement traffic can be distinguished from 
"normal" traffic. Authentication techniques, such as digital 
signatures, may be used where appropriate to guard against injected 
traffic attacks. 


The privacy concerns of network measurement are limited by the active 
measurements described in this memo. Unlike passive measurements, 
there Can be no release of existing user data. 


7. IANA Considerations 


IANA has registered the metrics defined in this document in the IP 
Performance Metrics (IPPM) Metrics Registry, see [RFC4148]. 
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